PIPE: a protein–protein interaction passage extraction module for BioCreative challenge
نویسندگان
چکیده
Identifying the interactions between proteins mentioned in biomedical literatures is one of the frequently discussed topics of text mining in the life science field. In this article, we propose PIPE, an interaction pattern generation module used in the Collaborative Biocurator Assistant Task at BioCreative V (http://www.biocreative.org/) to capture frequent protein-protein interaction (PPI) patterns within text. We also present an interaction pattern tree (IPT) kernel method that integrates the PPI patterns with convolution tree kernel (CTK) to extract PPIs. Methods were evaluated on LLL, IEPA, HPRD50, AIMed and BioInfer corpora using cross-validation, cross-learning and cross-corpus evaluation. Empirical evaluations demonstrate that our method is effective and outperforms several well-known PPI extraction methods. DATABASE URL.
منابع مشابه
Identifying Protein-Protein interactions in Biomedical publications
The paper describes the approaches and the results of our participation in the protein-protein interaction (PPI) extraction task (sub-tasks 1 to 3) of the BioCreative II challenge. The core of our approach is to analyse the logical forms of those sentences which contain the mentioning of relevant protein names, and to rank the sentences from which the relations where extracted using the class d...
متن کاملHITSZ_CDR System for Disease and Chemical Named Entity Recognition and Relation Extraction
In this paper, an end-to-end machine learning-based system was proposed for the challenge task of chemical and disease named entity recognition (DNER) and chemical-induced diseases (CID) relation extraction in BioCreative V, where DNER includes chemical and disease mention recognition (CDMR) and normalization (CDN). The system consists of six components: a preprocessing module, two individual s...
متن کاملThe eFIP system for text mining of protein interaction networks of phosphorylated proteins
Protein phosphorylation is a central regulatory mechanism in signal transduction involved in most biological processes. Phosphorylation of a protein may lead to activation or repression of its activity, alternative subcellular location and interaction with different binding partners. Extracting this type of information from scientific literature is critical for connecting phosphorylated protein...
متن کاملExtracting Interacting Protein Pairs and Evidence Sentences by using Dependency Parsing and Machine Learning Techniques
The biomedical literature is growing rapidly. This increases the need for developing text mining techniques to automatically extract biologically important information such as protein-protein interactions from free texts. Besides identifying an interaction and the interacting pair of proteins, it is also important to extract from the full text the most relevant sentences describing that interac...
متن کاملBioText Report for the Second BioCreAtIvE Challenge
This report describes the BioText team participation in the Second BioCreAtIvE Challenge. We focused on the Interaction-Article (IAS) and the Interaction-Pair (IPS) Sub-Tasks, which ask for the identification of protein interaction information in abstracts, and the extraction of interacting protein pairs from full text documents, respectively. We identified and normalized protein names and then...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 2016 شماره
صفحات -
تاریخ انتشار 2016